Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks
نویسندگان
چکیده
In this paper, we have evaluated a noisy speech recognition method based on noise reduction and acoustic model adaptation, on the AURORA2 tasks. For noise reduction method, we employed two noise reduction methods. One is an Adaptive Sub-Band Spectral Subtraction (ASBSS) method which can vary noise subtraction rate according to SNR in frequency bands at each frame. The other is a Kalman filtering estimation method which re-estimates the accurate speech spectra from speech spectra estimated by ASBSS. We estimated the accurate speech spectra by combining these methods. Usually, the noise reduction method has a problem that it deteriorates the recognition rate because of spectral distortion by residual noise of noise reduction and over estimation. To solve the problem in noise reduction method, adaptation of the acoustic models is employed by using unsupervised MLLR adaptation to spectral distortion. In evaluation on the AURORA2 tasks, our method showed the significant improvement in recognition accuracy both clean training condition and multi training condition.
منابع مشابه
Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -
In this paper, we propose a noise robust speech recognition method by combination of temporal domain singular value decomposition(SVD) based speech enhancement and Gaussian mixture model(GMM) based speech estimation. The bottleneck of GMM based approach is a noise estimation problem. For this noise estimation problem, we incorporated the adaptive noise estimation in GMM based approach. Furtherm...
متن کاملStochastic vector mapping-based feature enhancement using prior-models and model adaptation for noisy speech recognition
This paper presents an approach to feature enhancement for noisy speech recognition. Three prior-models are introduced to characterize clean speech, noise and noisy speech, respectively. Sequential noise estimation is employed for prior-model construction based on noise-normalized stochastic vector mapping. Therefore, feature enhancement can work without stereo training data and manual tagging ...
متن کاملHmm Composition-based Rapid Model Ad Gmm Adaptation Evaluation O
When a speech recognition system is used in a real environment, its recognition performance is affected by the surrounding noise. Most types of additional noise as well as SNRs are difficult to predict, so there is a mismatch between the training and test data. We need a method to deal with this problem. In this paper, we propose an HMM compositionbased model adaptation method with a priori noi...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کامل